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The AMU gene is an essential transcription factor regulating 
the differentiation of hematopoietic stem cells into mature 
blood cells. Though at least 12 different alternatively spliced 
AMU mRNAs are generated, three splice variants {AMU a, 
AMUb and AMUc) have been characterized. Here, using the 
reverse transcription-polymerase chain reaction with outward- 
facing primers, we identified a novel non-polyadenylated 
transcript from the AMU gene, with exons 5 and 6 scrambled. 
The novel transcript resisted RNase R digestion, indicating it is 
a circular RNA structure that may originate from products of 
mRNA alternative splicing. The expression of the novel 
transcript in different cells or cell lines of human and a 
number of other species matched those of the canonical 
transcripts. The discovery provides additional evidence that 
circular RNA could stably exist in vivo in human, and may 
also help to understand the mechanism of the regulation of the 
AMU gene transcription. [BMB Reports 2013; 46(3): 163-168] 



INTRODUCTION 

The Runt-related transcription factor 1 gene {RUNX-1 alias 
AMU), located on chromosome 21q22, consists of 12 exons 
and spans 261 kb of genomic DNA. It plays an important role 
in establishing hematopoiesis and the generation of hema- 
topoietic stem cells in the embryo (1). Transcription of AMU 
is initiated at two distinct 5' regions, a distal region-promoter-1 
(P1) and a proximal region-promoter-2 (P2). The two pro- 
moters generate at least 12 alternatively spliced mRNAs, 
which differ within the 5' and 3' untranslated regions (UTR) 
and coding regions. Among these variants, the predominant 
transcripts AMUb and AMUc code for a 453-amino acid pro- 
tein and a 480-amino acid protein, respectively, while the mi- 
nor AMU a codes for a protein, which has a higher affinity for 
DNA binding and suppresses the transcriptional activation of 
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AMUb. To date, 55 partner chromosome bands and 21 trans- 
locations have been described at the molecular level, which 
are considered as contributors to leukemia, especially acute 
myeloid leukemia (AML), using different pathways (1). 

In our previous study, we used a pair of outward-facing pri- 
mers to identify a novel partner gene of AMU in an acute 
myeloid leukemia patient. We randomly selected 3 healthy in- 
dividuals as negative control, and to our surprise, three un- 
expected bands of the same size could be amplified in all 
samples. By sequencing, we accidentally discovered the se- 
quence of the exons was arranged in a different order com- 
pared to the genomic DNA, which consisted of a 5-6-5 exon 
alignment. These findings raise several fundamental questions: 
Where does it originate from? What does its whole structure 
look like? 

In this study, we have presented evidence from RT-PCR us- 
ing outward-facing primers that further examine its presence. 
Also, through several experiments, we investigated its 
structure. Finally, the expression of this transcript in a number 
of human cell types, and several other species was studied. 

RESULTS 

AMU produces the abnormal transcript with the 5-6-5 exon 
alignment 

To examine the abnormal transcript consisting of the 5-6-5 
exon alignment, we performed PCR with total RNA from bone 
marrow or peripheral blood of 10 AML patients and 10 
healthy individuals using a pair of outward-facing primers 
(oF/oR) from exon 5 (as in GenBank D43969). Fragments were 
observed in all samples at approximately 305 bp, and absent 
in the negative and blank controls. It was noteworthy that the 
305 bp fragment contained an abnormal exon 5 following the 
original exons 5 and 6 (Fig. 1A). 

To further examine whether the product with the abnormal 
exon order was an artifact, we used additional pairs of primers 
(6F/5R2 and 5F2/5R1). Also, primers F65 and R65 were de- 
signed to cover the boundary of the exons 6-5 and these were 
used in further PCRs with the primer pairs F65/5R3 and 
5F1/R65. We confirmed that these amplified products con- 
tained the expected sequences (Fig. 1 B). 
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Fig. 1. Analysis of RT-PCR results. (A) Detection of the novel AMU transcript with the exon 5-6-5 alignment using RT-PCR and sequence 
analysis. Lanes 1 and 4 show the amplification products of the novel transcript in an AML patient and a healthy individual respectively, 
Lanes 2 and 5 are negative control, and Lanes 3 and 6 are blank control. The band above 305 bp in Lane 4 is nonspecific amplification 
product. The positions and directions of outward-facing primers (oF and oR) in the cDNA structure are indicated by arrows. The sizes of 
the expected fragment and the junction sequence of the 3' end of exon 6 and 5' end of exon 5 are also shown. (B) Detection of exon 
5-6-5 alignment using different pairs of primers. Positions and directions of different pairs in the cDNA structure are indicated by arrows. 
The amplification products of primers 6F/5R2, 5F2/5R1, F65/5R3 and 5F1/R65 are shown. By sequencing, the bands of 118 bp and 284 
bp in Lanes 1 and 4 and the bands of 116 bp and 315 bp in Lanes 7 and 10 respectively are the products, Lanes 2, 5, 8 and 11 are 
negative controls, Lanes 3, 6, 9 and 12 are blank controls. (C) Analysis of RT-PCR results and schematic diagram of the positions and di- 
rections of primers used to seek the neighboring exons of the exon 5-6-5 alignment. Lanes 1 to 4 are products of RT-PCR using 3F/7aR, 
3F/7bR, 4F/7aR and 4F/7bR, respectively. Asterisks show the canonical transcripts without exon 6. Lanes 5 to 8 are products of 3F/R65, 
4F/R65, F65/7aR and F65/7bR, respectively. Lane 9 shows the amplification product of F65/6R, Lane 10 is negative control. 



Identification of the neighboring sequences upstream and 
downstream of the 5-6-5 exon alignment 

To explore whether the novel transcript possesses the same 
neighboring sequences upstream and downstream as the can- 
onical sequences, PCR was performed using a further set of 
primers. The fragment could be amplified using primers 
F65/6R (Fig. 1C), while the expected band was not present us- 
ing the primers 3F/R65, 4F/R65, F65/7aR and F65/7bR. The 
canonical transcripts AM/Ja and AMLIh were successfully 
amplified using 3F/7aR, 3F/7bR, 4F/7aR and 4F/7bR (Fig. 1Q. 
Hence, we concluded that the 5-6-5 alignment was followed 
with an exon 6, but it did not contain exons 3 or 4 upstream or 
exons 7a or 7b downstream. 



Poly(A) tail is absent in the exons 5 and 6 scrambled AML1 
molecule 

We performed Rapid Amplification of cDNA Ends (RACE) to 
obtain either the 3' or 5' end of the novel transcript, but were 
unable to form any fragments (data not shown). To investigate 
whether a polyadenylated tail was present in this scrambled 
exon alignment, total RNA was reverse-transcribed with either 
random primers or oligo(dT) primers. PCR was then performed 
using the primers shown in Table 1 . Only the random primers, 
and not the oligo(dT) primers, produced the scrambled tran- 
script (Fig. 2A). Also, poly(A)+ and poly(A)- RNA was pre- 
pared from total RNA, and the quantity of poly(A)+ and 
poly(A)- RNA were equal when reverse transcribed using ran- 
dom primers. Canonical transcripts were amplified to testify 
the purity of the three samples using the primers 4F/5R1 . As a 
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result of amplification using the different pairs of primers 
shown in Table 1, the expected bands were present in the total 
RNA and poly(A)- RNA Lanes while absent in the poly(A) + 
Lane (Fig. 2B). 

The novel AMLt transcript is probably circular 

As the transcript lacked the poly(A) tail, we used the One Step 
PrimeScripf 8 miRNA cDNA Synthesis Kit (Takara, Catalog No. 
D350A) to add a segment of polyadenylate to it when it was 
reverse transcribed, allowing its 3' end to be amplified using 
our special forward primer and the Uni-miR qPCR Primer 
which is reverse to the adaptor sequence supplied in the kit. 
We were unable to add a tail to the novel transcript, while the 
reference gene U6 formed the expected mono peak after re- 
al-time PCR and the expected band on an agarose gel after 
conventional PCR (data not shown). 

To further testify the conformation of the non-canonical tran- 
script, we determined its sensitivity to the RNA exonuclease R 
(RNAse R), which specifically digests linear RNA, both struc- 
tured and non-structured, but does not digest circle or lariat 
RNA (2, 3). We performed real-time PCR to detect the change 
of its expression after RNase R digestion using cDNAs of five 
healthy individuals. The results showed a significant increase 
in the circular transcript. The result of calculation using the 
equation 2" AQ showed there was on average a 282-fold in- 
crease (Table 2), confirming its circular nature. 

The expression of the novel transcript in different cells or cell 
lines of human and several other species matched those of the 
canonical transcripts 

To determine the ubiquity of the novel transcript, we em- 
ployed RT-PCR using cDNAs from different cells or cell lines 
of human and several other species (Fig. 2C). It turned out that 
the novel circular transcript of AMU existed not in lower or- 
ganisms or plant, but in higher animals we selected. 



Table 1. Nucleotide sequence of primers 



Primer 


Sequence 


3F 


5'-TGCCCATCGCTTTCAAGGTG-3' 


4F 


5'-ACCGCAGCCATGAAGAACCAGGTTG-3' 


5F1 


5'-GGAAAAGCTTCACTCTGACCATC-3' 


5F2 


5'-CGCCACCTACCACAGAGCCATCA-3' 


oF" 


5'-GTCTTCACAAACCCACCG-3' 


oR d 


5'-GTGAAGACAGTGATGGTCAG-3' 


5R 


5'-GATGGCTCTGTGGTAGGTGGCGACT-3' 


6F 


5'-AACCACTCCACTGCCTTTAAC-3' 


6R1 


5'-CAGTGGAGTGGTTCAGGGAG-3' 


6R2 


5'-CTGCATCTGACTCTGAGGCTG-3' 


F65 


5'-CTCAGAGTCAGATGCAGGGGA-3' 


R65 


5'-GTCAGAGTGAAGC I I I ICCCCT-3' 


7aR 


5'-ATGCTTCCCATGTGCTGTGC-3' 


7bR 


5'-CAGGTATTGGTAGGACTGATCGTAG-3' 



a Primers oF and oR are designed outward facing. 
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DISCUSSION 

In this manuscript, we have successfully identified a novel 
AML 1 transcript containing the donor site of exon 6 linked to 
the acceptor site of exon 5. Our investigations have gained an 
insight into this novel transcript, characterizing its sequence 
and conjecturing on its structure. The observation of the novel 
transcript prompted experimental analysis of several possible 
interpretations of this phenomenon. Originally, this transcript 
was thought to arise from either the expression of a duplicate 
exon 5 to exon 6 genomic segment or an intermolecular splic- 
ing (trans-splicing) of two AML 7 pre-mRNAs. When reverse 



A 



oF&oR F65&6R 5F1&R6S 4F&5RJ 
Random + + + + 

OBgo(dT) + + + + 




Fig. 2. Observation of poly(A) tail in the novel transcript and ex- 
pression of the novel transcript in human normal cells, cancer cell 
lines and several other species. (A) RT-PCR results. 1 ug total RNA 
was subjected to reverse transcription using either random primers 
or oligo(dT) primer and amplified by PCR using three pairs of pri- 
mers oF/oR, F65/6R, and 5F1/R65. The canonical transcript was am- 
plified using 4F/5R1. (B) RT-PCR results of total RNA, poly(A) + 
RNA and poly(A)- RNA. Canonical transcripts were amplified using 
primers 4F/5R1 (shown on the left). Results of PCR amplifying the 
novel transcript are shown on the right. (Q Lines 1 to 2 respectively 
represent the amplification product of Oral mucosa and nucleus 
pulposus cells of an intervertebral disc using primers 5F1/R65; Lines 
3 to 7 respectively represent U266, K562, SMMC-7721, AGS and 
rat NRK52E cells; Line 8 represents tetrahymena; Lines 9 to 10 re- 
spectively represent Euplotes elegans and Tartary buckwheat. The 
positive control represents cDNA from bone marrow of healthy in- 
dividuals, followed by the negative control. 
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Table 2. The enrichment of the novel transcript after RNase R digestion 



CT value 



Sample R + a R- a 2" At Mean + standard deviation 



Gene ABL1 Gene ABL1 



1 19.1111 18.1087 27.3767 17.1736 588.4190 

2 29.3284 20.1606 35.2505 19.0501 130.9253 

3 20.336 19.6205 26.3639 18.8059 114.7619 282.6934 ± 207.8469 

4 24.9551 21.1963 32.4336 20.4433 300.5581 

5 20.9352 19.1885 28.8849 19.0151 278.8026 



a R+ and R- represent the groups with or without RNase R digestion respectively. 



transcriptase was omitted from the protocol, the product was 
not detected, as shown in Lane 2 in Fig. 1 A, indicating that the 
amplified products did not result from contamination of the ge- 
nomic DNA. We also excluded the possibility that it was pro- 
duced by the self-amplification of a single primer (data not 
shown). Additionally, the novel transcript was non-poly- 
adenylated, and resisted RNase R digestion, a 3'-5' exoribonu- 
clease that digests linear RNA. These results indicated that the 
novel AMU transcript with scrambled exons was a stable, nat- 
urally occurring, circular RNA species, as previously reported 
for other human genes. (2, 4-13). 

Miyoshi et a/. (4) identified three authentic transcripts of 
AMU using Northern blotting, but did not identify any circular 
forms. Our group also performed Northern blotting and we 
were also unable to detect the circular transcript (data not 
shown). The following two explanations may account for the 
negative result of the Northern blotting: (i) the content of the cir- 
cular transcript is indeed insufficient to be detected by Northern 
blotting; (ii) circular RNA possesses unique secondary structures 
to give it stability in the cytoplasm, leading to an obstacle in de- 
tection via probe labeling. At present, the only techniques to 
identify that a molecule is circular RNA, are the results of 
RT-PCR using outward-facing primers and RNase R digestion. If 
a molecule is linear, it could not exhibit enrichment after RNase 
R digestion and the value of 2" AQ is approximately 1 . However, 
by detecting the change of the novel AMU transcript after 
RNase R treatment in five healthy individuals using SYBR Green 
based real-time PCR, we observed that there was approximately 
a 282-fold increase on average (Table 2). 

Circular RNA, first reported in Viroids (14), has a particular 
structure where the 3' donor splice site is joined precisely to 
the incongruous 5' acceptor splice site. This type of structure 
did not attract great attention from researchers until circular 
transcripts were identified from the tumor suppressor gene, 
DCC, in rodent and human cell lines (5). Eukaryotic circular 
RNA was then reported in the human ets-7 gene (6, 7) and 
subsequently in the mouse testis-determining gene, sry (8), the 
rat cytochrome P-450 2C24 gene (9), the human cytochrome 
P-450 2C18 gene (10), the Na/Ca exchanger gene 1 (12), the 



dystrophin gene (11, 13) and the /NrC4/ARF-associated non- 
coding RNA (2). Noteworthily, recently Salzman et a/., by 
means of RNA-seq, has found circular transcripts of hundreds 
of human genes in diverse cell types (15). Here, we report a 
novel circularized transcript from the AMU gene, and provide 
additional evidence that circular RNA exists in humans. 

Hitherto, it was reported that circular RNA may consist of 
several contiguous exons (2, 9-1 1), exons not in succession or 
only a single exon (8, 10, 12, 13). The intriguing mechanism 
of circularization of scrambled exons was inferred to be a spli- 
ceosome-mediated alternative splicing event. It seems reason- 
able to suppose that they may simply represent errors during 
the normal splicing process, namely mis-splicing (7), or lariat 
splicing and inverse splicing (9), or trans-splicing (10), though 
in v/Vo-generated trans-spliced molecules are always poly- 
adenylated as are canonical mRNAs (16, 17). Exon skipping 
was considered to be associated with the formation of circular 
transcripts (10), by using 'molecular patches' to mask the skip- 
ped exon. We propose that there exists a particular structure 
which favors RNA circularization, a nuclease that recognizes 
this structure and a ligase that bridges the splice sites. It might 
be similar to the process of the generation of microRNA, 
which possess a hairpin structure, in which inverted repeat 
transcripts are precisely cleaved by RNase III enzyme(s) in the 
Dicer and/or Drosha protein families (18). 

What does the circular species exist for? Is it trash or treas- 
ure? Since circular transcripts are reported from different 
genes, and are stable in the cytoplasm (1 , 7, 8, 1 0), we support 
the hypothesis that they have a biological function beyond the 
splicing process. Chen and Sarnow (19) have demonstrated 
that if circular transcripts contain an internal ribosome entry 
site (IRES) element, they can be translated into proteins. 
Moreover, the circular NCX7 exon 2 transcript was deduced to 
encode a truncated 602-amino acid protein (12), though the ef- 
ficiency of translation might be affected by its circular structure 
as well as by the lack of a cap or poly(A) tail (20, 21). 
However, the majority of circular transcripts, like the circular 
AMU transcript, does not possess internal initiation sites for 
translation and probably do not code for a protein. 
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The non-coding characteristic of the circular RNA species 
are reminiscent of the group of non-coding RNAs, which are 
also products of the splicing process and challenge the central 
dogma. Recent studies indicate that members of this non-cod- 
ing RNA family are far more abundant and important than con- 
sidered initially (22). What is the relationship between circular 
RNAs and non-coding RNAs? Could we suppose that these cir- 
cular RNA transcripts belong to the non-coding RNA species? 
Since the discovery of microRNA and the reporting of their 
meaningful function, more attention has been paid to these 
non-canonical mRNAs. 

Whether the circular transcripts encode protein or not, study 
on their biological function has been going through a bottle- 
neck period. To date, due to the limited approach and technol- 
ogy, there is no way to figure out what physiological or patho- 
logical functions circular RNA may have. Therefore, it still re- 
quires us to seek new approach or develop new technology to 
further disclose the mysterious mask of these circular tran- 
scripts in the future. 

In summary, we identified a novel transcript from the AMU 
gene, which is likely to be a circular RNA molecule. We con- 
cluded that the novel circular transcript of AML1 is just con- 
sisted of exon5 and exon6. In other words, in the novel tran- 
script exon5 and exon6 were joint head to tail. Therefore, we 
inferred its full length is length of exon5 plus that of exon6, 
that is 297 bp. The novel transcript was identified in different 
cells or cell lines of human and rat cells we selected, but not 
in lower organisms. The circular RNA transcripts may suggest 
a novel RNA species, which may help to integrate the ge- 
nomic context. But this still leads to the debate about their un- 
known biological role. They should attract our attention, just 
like the non-coding RNA. We believe there is likely to be cir- 
cular RNA transcripts of other genes, though there is yet no di- 
rect method to detect them. Moreover, the specific mechanism 
of their formation also needs to be elucidated. 

MATERIALS AND METHODS 
Sample preparation 

Bone marrow samples were from healthy individuals and AML 
patients, who were diagnosed using the MICM classification 
standard. Oral mucosa and nucleus pulposus cells of an inter- 
vertebral disc were also obtained from healthy individuals. All 
samples were collected with informed consent from the 
subjects. The patients' IRB number is NO. 2012057 from 
Shanxi Medical University Medical Ethics Committee. 

U266, K562, SMMC-7721, AGS and NRK52E cells were de- 
posited in our laboratory, and cDNAs of tetrahymena, Euplotes 
elegans and Tartary buckwheat were given by Dr. Shen Quan. 

Preparation of total RNA, poly(A) + and poly(A)-RNA 

Total RNA was extracted using RNAiso Plus (Takara, Catalog*- 
D9108A) according to the instruction of manufacturer. The 
mRNA was isolated from total RNA using PolyATtracf *' mRNA 



Isolation Systems III (Promega, Cat#Z5300), and the poly(A)- 
RNA was recycled from the supernatant by ethanol precipi- 
tation. Purity and concentration were detected by NanoDrop 
2000 (Thermo, USA) before being stored at -80°C. 

Reverse transcription-polymerase chain reaction (RT-PCR) 

RNA (1 u,g) was in vitro reverse-transcribed into cDNA with re- 
verse transcriptase M-MLV (RNase H-) (Takara, Tokyo, Japan, 
Cat#D2640A) using random primers (Promega, Madison, Ml, 
USA) and an oligo(dT) primer (Takara, Cat#D511), respec- 
tively. As a negative control, cDNA prepared without reverse 
transcriptase was employed. 

The PCR was performed with a set of outward-facing pri- 
mers designed to bind in exon 5, oF and oR. Amplification 
conditions were as follows: denaturation at 94°C for 3 min; 35 
cycles of denaturation at 94°C for 30 s, annealing at 53°C for 
30 s, and extension at 72°C for 30 s; plus a final extension step 
at 72°C for 10 min. PCR was also employed with other sets of 
primers, using 60°C as the annealing temperature. The primers 
were designed by Primer 5.0 (Table 1). 

Each 25 ul PCR reaction contained 1 ul cDNA template, 2.5 
ul of 10x Taq buffer, 1.5 ul of 25 mM MgCI 2 , 0.5 uJ of 10 
mM dNTPs, 1.5 U of Taq DNA polymerase (Fermentas, 
Cat#EP0282) and 1 u.mol/L of forward and reverse primers. 
Aliquots (10 uJ) of the amplified PCR products were loaded 
onto 2% agarose gels for electrophoresis and visualization 
with ethidium bromide. The band was cut from the gel, puri- 
fied using the Gel Extraction Kit (Omega, Cat#D2500-01), and 
sequenced by the Beijing Aucg Biotechnology Company. 

RNase R digestion 

RNA (20 u,g) was incubated with or without 40 U of RNase R 
(Epicentre Biotechnologies, Cat* RNR07250) for 2 h at 37°C. 
The resulting RNA was purified by ethanol precipitation and 
quantified. Equal amounts of RNA with or without RNase R treat- 
ment were subjected to reverse transcription using random 
primers. Expression of the novel transcript was detected using 
SYBR Green-based real-time PCR, which was performed using 
the 7300 Real-Time PCR System (Applied Biosystems, USA) with 
SYBR Green I detection and Tm analysis. SYBR® Premix Ex Taq™ 
II (perfect real time) kit (Cat#DRR081A) was purchased from 
Takara. The optimized reaction was performed in a 25 uJ final re- 
action volume containing 12.5 ul of kit-supplied SYBR v8) Premix 
Ex Taq™ II, 1 uJ ROX Reference Dye (50x), 0.5 ul of both the 
forward and reverse primers (each 10 uM), 2 u.1 cDNA template, 
and 8.5 u.1 distilled water giving a final volume of 25 u,l. 
Amplification conditions were as follows: 95°C for 30 s, followed 
by 40 cycles of 95°C for 5 s, 60°C for 31 s with a dissociation 
step. The equation 2" AQ was used to measure the change of the 
novel transcript expression after RNase R digestion. 
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